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We prove tight bounds on the relaxation time of the so-called 
L-reversal chain, which was introduced by R. Durrett as a stochastic 
model for the evolution of chromosome chains. The process is de- 
, scribed as follows. We have n distinct letters on the vertices of the 

' n-cycle {Z mod n); at each step, a connected subset of the graph is 

chosen uniformly at random among all those of length at most L, 
and the current permutation is shuffled by reversing the order of the 
letters over that subset. We show that the relaxation time T(n,L), 
defined as the inverse of the spectral gap of the associated Markov 
generator, satisfies T{n,L) = 0(n V j^)- Our results can be inter- 
^ preted as strong evidence for a conjecture of R. Durrett predicting a 

^ . similar behavior for the mixing time of the chain. 

0^ 

\ 1. Introduction, models and results. In a series of recent papers, R. Dur- 

■ rett has proposed stochastic models based on shuffling rules for the analysis 

of the evolution of chromosomes [6, 7, 8]. One of the main issues is that of 
' determining the time needed for such processes to reach stationarity. Besides 

. proving several results in this direction and discussing possible applications, 

1 ^ \ he pointed out a number of interesting conjectures. One of them refers to 

' the so-called L-reversal model, which is described as follows: 

We consider a chain (the chromosome) of n distinct genes, so that the 
configuration of the system at any given time is a permutation rj oi n let- 
ters over n vertices. The vertices are lying on a circle and we take the 
graph structure of the n-cycle (where each vertex is connected to exactly 
' two vertices). Given an integer L <n, one step of the L-reversal process is 

described as follows. Uniformly at random we pick a vertex x and, indepen- 
dently, choose a number 1 <i < L. Given x and I, we perform the transition 
rj fj^/^ where r]^'^ is obtained from 7] by reversing the order of the letters 
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in the segment {x,x + 1, . . . ,x + i}, where the sums are taken modulo re. 
Note that the probabihty of the transition rj — > r]^'^ coincides with that of 
the transition r/^'^ — > rj, and, therefore, we have reversibihty with respect to 
the uniform distribution v over all n! configurations. As in [6], we consider 
the continuous-time version of the L-reversal dynamics, in which the steps 
described above are performed at the arrival times of an independent rate 
1 Poisson process. This defines a continuous-time Markov chain converging 
to the uniform distribution u. 

Let T(n, L) denote the mixing time of the chain, that is, 

(1) r(n, L) = inf{t > : \\pt - z^||tv < 1/e}, 

where || • ||tv stands for the total variation distance, and pt denotes the 
distribution of the configuration at time t for the process started from some 
arbitrary (i.e., completely ordered) configuration t] (note that \\pt — i^Wty 
does not depend on the chosen rj, because v is the uniform measure). 

The striking conjecture in [6] (see also Problem 4.1 in [8]) is that the 
mixing time T(re,L) of the L-reversal process should satisfy 

1 / fj3\ / re'^\ 

(2) _ U V j log n < T{n, L) < C ( re V j logre, 

for some universal constant C < oo. Here, for numbers a,b, we use a V 6 
to denote the maximum max{a,b} . Note that, when L = 1, we have the 
usual transposition dynamics on the re-cycle, which has been studied in great 
detail; see, for example, [4, 5, 9, 10, 11]. In particular, the conjecture is known 
to hold in this case. Of most interest are the cases L = re°, a S (0, 1). 

To support the conjectured bounds in (2), Durrett uses an adaptation of 
Wilson's method to estimate mixing times [11]. In this way, by a careful 
choice of the slow modes of relaxation, he proves the lower estimates in (2). 
As for the upper estimates, by comparison with random transposition, he 
proves that 

(3) r(re,L)<C-^logre. 

When L = re", a G (0, 1), the estimate (3) is off by a factor of order re" for 
a < 2/3 and n^^^"") for a > 2/3 . This shows that the comparison with 
transpositions is efficient only in the two extreme regimes a G {0, 1}. We are 
not able to prove (or disprove) the desired upper bound in (2), but we have 
considered the problem of establishing similar estimates for the relaxation 
time T{n,L), that is, the inverse of the spectral gap of the chain. What we 
find confirms the predicted behavior, at least at the level of the spectral gap. 

To describe our main result, we introduce some notation. For any function 
/ of the configurations, we write i>[f] = ^ J^rifiv) fo^' its expectation and 
Var(/) = z^[/^] — i^if]'^ for its variance w.r.t. the uniform measure u. Also, we 
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write for the function r] — > /(?7^' ) and Rx,ef = f for the associated 

gradient. The Dirichlet form of the L-reversal dynamics introduced above is 

The relaxation time is then given by 

fK\ f T\ Var(/) 

(5) r(ra,L) = sup ^^^ , 

with the supremum taken over all nonconstant functions /. 
Our main result reads as follows: 

Theorem 1.1. There exists C < oo such that, for any n>2 and any 
l<L<n, 

(6) ^(^nV^^<T{n,L)<c(nV^y 

As usual, the lower estimate in (6) can be obtained by a suitable choice 
of test functions; see Section 3 for details. To obtain a sharp upper bound 
on r(n, L), the argument is more delicate. Here we combine new comparison 
arguments with an improved iteration scheme inspired by recent work of 
Carlen, Carvalho and Loss [3]. We refer to the beginning of next section 
for a brief description of the main idea of the proof. Our method allows 
the derivation of sharp estimates for more general models in the class of 
so-called p-reversal processes, where each inversion 77 — > r/^'^ is performed 
with probability p{i), p being a probability vector on {l,...,n}. The L- 
reversal process (4) is then obtained for p{i) = j^, £ = 1, . . . , L. The so-called 
^-reversal process is defined by p{i) oaO^, 9 € (0, 1), with Dirichlet form 

(7) Seif, f) = E E 0'^'u[iRx,dn 

x=ie=i 

In this case, calling r(?i, 9) the relaxation time associated to (7) we obtain 

Theorem 1.2. There exists C < 00 such that, for any n>2 and any 
0G(O,1), 

(8) ^(n V [n(l - 9)f) < T{n, 6) < C{n V [n(l - 9)f). 

We conclude this introduction with a final remark on Durrett's conjec- 
ture (2). It is well known that T{n,L) > T{n, L)/C . On the other hand, an 
application of standard bounds (see, e.g., Theorem 2.1.7 in [10]) only gives 

T(n,L) <CT(n,L)log(n!). 
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Better estimates can be obtained if one considers the decay of relative en- 
tropy functionals rather than L^-norms. Namely, let P{n,L) denote the en- 
tropy dissipation constant given by 

X Ent(/) 
/>o '?(/,log/) 

with the supremum taken over all positive nonconstant functions. Here we 
use the standard notation Ent(/) = i/[/log/] — logi/[/] for the entropy 
of /. It is not hard (see, e.g.. Corollary 2.8 in [1]) to obtain the bound 

r(n,L) <C/3(n,L)loglog(n!). 

Therefore the conjecture (2) would follow if we could prove the bounds 
(6) with r replaced by (3. Despite recent progress in the understanding of 
entropy dissipation in various reversible Markov chains, the extension of our 
techniques to this case remains a challenging problem. 

It is perhaps interesting to observe that the predicted mixing time cannot 
be derived from logarithmic Sobolev inequalities here. Indeed, we will show 
that if s(n,L) denotes the log-Sobolev constant [obtained as in (9) with 
£{Vf,Vf) replacing <S(/,log/)], then 

(10) s(n,L)>— . 

Comparing this with the bounds in Theorem 1.1, for L = n", the bound (10) 
shows that s{n,L) > T{n, L)n?'^~^ when 1/2 < a < 2/3 and s{n,L) > 
T{n,L)n^~°' when a > 2/3. While comparison with random transposition 
can always be used to prove an upper bound of the form s{n,L) < 
C(n^/L^)logn (see, e.g.. Theorem 6 in [6]), it remains an open question 
to understand the true asymptotic behavior in L,n of the log-Sobolev con- 
stant. 

The rest of this paper consists of two sections. In the first, we briefly 
outline the proof of the upper bound on the relaxation time, develop all the 
technical parts of our approach and finally prove the corresponding estimates 
of Theorem 1.1 and Theorem 1.2. In the final section, we prove the matching 
lower bounds and derive the estimate (10) on the log-Sobolev constant. 

2. Upper bounds. The main idea needed for the proof of the upper 
bounds in Theorems 1.1 and 1.2 is to compare the L-reversal chain with 
(an average over i < L of) a suitable "block dynamics" with blocks of size i 
that we call the block-average dynamics. Such an auxiliary process seems to 
be at the heart of several shuffling chains, and it can be described informally 
as follows (see Section 2.3 below for the formal definition): 

Assume for simplicity that n = Ni for some integers N, £ and partition the 
vertices {l,2,...,n} into N nonoverlapping blocks {Ij} of length £. Then, 
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with rate a pair of blocks is chosen, say li and Ij, and the letters 

inside /j U Ij are uniformly reshuffled over the vertices of U Ij . We refer to 
this move as an "average" of the two blocks and Ij. A first key step then 
consists in proving that the relaxation time of the above block dynamics is 
0(1) uniformly in N,i. We are actually able to compute exactly the spectral 
gap of this Markov chain (see Proposition 2.3 below). This computation is 
carried out via an adaptation and extension of the technique introduced by 
Carlen, Carvalho and Loss [3] for so-called Kac systems. 

The comparison of the above-described block-average dynamics to the 
L-reversal chain is then accomplished in two further steps: 

In the first step, we compare the block-average dynamics to an interme- 
diate block dynamics in which only adjacent blocks are averaged but we 
also allow an "exchange" between two adjacent blocks. We say that blocks 
are exchanged if the configuration of letters inside /j is interchanged 
with that inside Ij+i. We shall call this intermediate dynamics the local 
average-exchange block dynamics. This step of the comparison uses rather 
standard path techniques, but the crucial point is that the usual diffusive 
scaling factor N'^ appears only in front of the exchange operations and does 
not multiply the local average moves (see Lemma 2.5). 

In the second step, we compare the local average-exchange block dynam- 
ics to the L-reversal chain. The key to this step is the observation that moves 
of the L-reversal process can efficiently simulate both block exchanges and 
block averages. In fact, the local exchange move is easily expressed in terms 
of (three) suitable ^-reversal moves Rx/ (see Lemma 2.6). Also, the average 
of a pair of adjacent blocks /j and /j+i can be compared to inversions, using 
the Poincare inequality for random transpositions on the complete graph 
generated by vertices in /j U /j+i. Finally, we use a straightforward compari- 
son between the latter and a 2^-reversal chain on /j U /j+i (see Lemma 2.7). 
This does not spoil the estimate because, as discussed in the Introduction, 
the random transpositions chain and the L-reversal chain can be efficiently 
compared in the extreme case L = n (i.e., a = 1). 

2.1. Setting and notation. The setting and frequently-used notation are 
described as follows. Vn is the set of n ordered vertices {1,2, . . . Unless 
otherwise stated, the sum x + y \s assumed to be taken modulo n, for any 
x,y € Vn- A permutation (often called a configuration) of n letters over V„ is 
denoted by r], with rjx = j meaning that we have letter j at vertex x G V„. We 
write for the space of all n! configurations, is the uniform distribution 
on fi. For any integer m < n, / C is called an m- block if I is of the form 
{x -\- l,x + 2, . . . ,x + m} for some x GVn. Given e N, we write rj]^^rn for the 
configuration over the m-block {[k — l)m + 1, (A; — l)m + 2, . . . , /cm}, that 
is, ??i,m gives the letters over the first m-block, r/2,m those over the second 
m-block and so on. Also, let 0^ denote the set of possible configurations 
over the the first m-block (i.e., all possible realizations of ?/i,m). 
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2.2. Preliminary spectral estimates. We start with a spectral computa- 
tion that win be used in the arguments below. For any m < n, let /C denote 
the symmetric stochastic matrix given by the conditional probabilities 

(11) IC{a,f3) :=iy[i]i^m = P\m,rn = a], a,(3£^lm- 

In words, lC{a,f3) stands for the probability of the configuration /3 occuring 
in the first m-block, given that the configuration over the second m-block 
equals a. 

Lemma 2.1. For any m < n/2, the spectrum of IC is given by 

(12) X,:={-1)\ )[ m ) ' fc = 0'---'-- 
Proof. Let Tim denote the space of all functions ip : Q.m — > M and write 

(13) ((^, i^) = —.Yl f{m,ni)ip{m,ni) = j — - '^(«)V'(a) 

n! n\ ^ 

for the scalar product of G Tim- We write Idp for the function 

lCip{a)= J2 IC{a,(3)ip{P). 

By the definition of IC{a,(3), IC(p does not depend on the order of the m 
letters, for any if G TCm- If Ti-m denotes the linear subspace of functions ip G 
Tim that are independent of the order of the m letters, then any eigenvector 
of IC must be in Tim- Clearly, constant functions give the eigenvalue Aq = 1. 

We say that letter j belongs to q G (and write j G a) if the letter 
labeled j appears in the configuration a. For any j = 1, . . . ,n, we define 
Xj to be the indicator function of the event {j £ a}. To determine the 
spectrum, we may proceed as follows. Consider distinct letters ji,...,jk, 
where 1 < k < m. If at least one of the k letters belongs to a, then clearly 
^[Xji ■ ■ ■Xjfc](Q^) = 0- If none of them belong to a, the probability of hav- 
ing the given k letters in rji^m, given that r/2,m. = a, is K,[Xji '"Xjk]i'^) — 
(n — m — k) m — k/ {n — m)m. Therefore, defining Afc as in (12), we have 

(14) ^X,vX,. = |Afc|(l-x,J---(l-X,J. 

For k G {l,...,m}, let lik denote the collection of functions of the form 
(/? = Xji ■ ■ ■ Xjk ' with k distinct letters. Also, let Ak denote the linear span of 

{Uq,Ui^ ■ ■ ■ Mk], 

with Uq denoting the constant function (p = 1. Observe that, by (14), the 
subspaces Ak are invariant for /C, that is, K,Ak C Ak - Moreover, Am coincides 
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with Tim- Since /C is self-adjoint w.r.t. (•,•), to establish (12) it is sufficient 
to prove that if ip £ Ak and ip is orthogonal to Ak-i, then 

(15) {ip,ICp) = Xk{(p,ip). 

To prove (15), observe that ip G Ak must be of the form ip = (j) + with 
(f) G Ak^i and ip a linear combination of functions in U^^ say ^ = ^iO-iipi- 
For each ipi^Uk, we use (14) to obtain ICipi = + Xki^i, with (pi G Ak-i- 
Setting cp = J2i Cii4>i-: we see that K,ip = + A^V- If ^ is orthogonal to Ak-i, 
we then have {ip,lC(p) = {ip,(j)) = 0. In particular, 

{ip,JCip) = Xk{ip,'ip) = Xk{^,ip). □ 



We now turn to a spectral gap estimate for an auxiliary block-dynamics. 
Suppose m < n/N for some integer N >2 and let Ii,...,Ij\f denote 
nonover lapping m-blocks in Vn- Consider the Markov chain which picks uni- 
formly at random one of the N blocks and, given the configuration on that 
block, chooses uniformly at random a compatible configuration on the com- 
plement of that block. In symbols, we are looking at the symmetric transition 
matrix 



(16) 



1 



N 



k=l 



The next result is a natural extension of the estimates in Theorem 2.1 of [3] 
and Lemma 2.2 of [2]. 



Proposition 2.2. Let ^ denote the lowest nonzero eigenvalue ofl — V. 
Then, 



(17) 



N-2 
N-1' 



Proof. For all functions f,g:Q^M., we denote by (/, g) the scalar 
product 



(18) 



1 



(/,5) = W/ff] = -E/(^M^)- 



Let TTfc :0 — > denote the projection r/ — > tji^, the restriction to the con- 
figuration on the block I^. We define the subspace F of sums of mean-zero 
functions of a single block: 



(19) 



[ k=l 



with {gk, 1) = Vfc 
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with (•, •) as defined in (13). We write V = ■^J2k=i'Pk, with 

In this way, 

coincides with the conditional expectation of /, given the configuration in 
the block J^, and is a function of the letters on the single block only. 
Therefore, Vf £ T for every / with z^[/] = 0. Since V is self-adjoint w.r.t. 
(■) 'Pf = whenever = 0, and / is orthogonal to F. To prove (17), it 
is then sufficient to establish 

(20) {f,{i-V)f)>^^{f,f), fer. 

Given / £ F, / = J2k9k ° '^k, we define iff = J2k9k^ a function on Qm. If fC 
is defined by (11), we have 

We now compute 

(/, f) = ^{9kOTrk,gj oTTj) 



(21) 



J2 5Z {9k,ICgj) + ^{gk,gk) 

k j-jy^k k 

J2{9k,ICipf) -J2i9k,ICgk)+J2i9k,gk) 
{if f ^Kif f) + Y^{gk,{l - K:)gk) ■ 



Similarly, observing that 



we compute, for every /c, 

{f,'Pkf) =^{giO '^i,'Pk{gj o VTj)) 



= Y1 J2 (9i° TTi, (JCgj) O TTfc) +J2(9i° T^u9k O VTfc) 
i j-.j^k i 

(22) 

= XI 51 ^Si,^ 9j) + i9k,ICgj) + {gi,}Cgk) + {gk,gk) 

i:i^kj:jytk j-j^'k i:iy^k 

= ((^/,/CV/) + 2(^/,/C(l - IC)gk) + {gk, (1 - ICfgk). 
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Averaging over k and rearranging terms, we finally have 
(23) 



{f,a-r)f) = ^{vf,ic{i-ic)ipf) 



1 



+ ^J2i9k,a-mN-l)+JC)g,). 



N 
ft 

Note that (21) and (23) allow us to reduce the assertion (20) to spectral 
estimates involving only the operator /C. These, in turn, will follow from 
Lemma 2.1. 

Consider the subspace 5 C F of symmetric functions. 



(24) 



S= |/ = go vTfc; 5:0m with {g,l) =o|. 



Since S is invariant for V, that is, VS C S, to prove (20) we may consider 
separately the cases f £ S and f -L S. When f G S, that is, / = J2k9 ° '^k, 
we have ff = Ng, and, rearranging terms in (21) and (23), we obtain 

(25) {f,f) = NiN-l)(^g,^- ^ 



(26) 



{f,il-V)f) = iN-lf{g,[l-^ 



N-1 



1 



N-1 



Since m < n/2, it is easily seen from (12) that |Afc+i| < |Afc|, for any k. More- 
over, since Nm < n, we have |Ai| < 1/{N — 1), so that K. + is nonnega- 

tive, and we may define its square root [fC + 77^]^^^- 5 = + 77^] ^^^5' 
so that {f,f) = NiN-l){g,g). Observe that {g,l) = {g, [IC + = 

(since [IC + jf^]^^'^ is self-adjoint and g is orthogonal to constants). There- 
fore, 

{g,}Cg) < X2{g,g), 

since Ai is negative and A2 is the largest positive eigenvalue (other than 
Ao = l). 

From (25) and (26), we then have 

{f,{l-V)f) = {N-l)'{g,{l-JC)g) 

(27) >{N-l)\l-X2){g,~9) 

N -I 

-(1-A2)(/,/). 



N 

From (12), we have A2 = m(m — l)/{n — m){n — m — 1). In particular, for 
any N >2 and m > 1 with Nm < n, 

(28) A2 < ^ 



{N-l 



\2- 
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N-l n \ \ ^ iV-2 



It follows that - A2) > fef • From (27), we conclude that 

{f,{i-r)f)>^{f,f), 

which proves the claim for / G 5. 

It remains to study the case / G S^. Let u be a generic element of S, that 
is, u = J2k ^0 ° TT/c, for some uq with {uq, 1) = 0. Computing as in (21), one 
has 

1 



(/,n) = (iV-l) 



N-l 



ipf,Uo 



Since uq is an arbitrary mean-zero function, requiring / G S-^ [i.e. (/, u) =0 
for all u& S] implies that [JC + jfzi]'Pf is a constant. It follows that 



N-l 



1 



N-l 



N 



1 



1 



N-l 



N- 



where we use the fact that 1) =0 (recall that each gj has mean zero by 
assumption). In particular, this shows that 



(29) 



The above identity says that the first term in the r.h.s. of (23) equals (iV — 2)/ 
(iV — 1) times the first term in the r.h.s. of (21). 

Let us now look at the second term in the r.h.s. of (23). If we define the 
functions = (1 — ICy^^gk, we have 

(30) ^ E i9k, (1 - mN - 1) + }C)g,) = ^Y.(9k, [{N - 1) + m)- 



N 



k k 

Using /C > Ai > —1/{N — 1), we obtain that (30) is estimated from below by 



(31) 



N -2 



N-2 



N- 



This is precisely (A^ — 2)/{N — 1) times the second term in the r.h.s. of (21) 
In conclusion, (29) and (31) show that 



(32) 



AT — 2 

if,il-r)f)>j^if,f), feS^. 



From (27) and (32), we obtain the claim (17), and the proof is complete. 
□ 
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2.3. The block-average dynamics. Let i and N be given integers and 
suppose that we have exactly n = Ni vertices. We partition the vertex set 
Vn by means of nonover lapping ^-blocks Ii/,. . . , In,i- There are i choices 
for such partitioning, but, by symmetry, our estimates will not depend on 
the choice. For the sake of definiteness, we take the ^-blocks Ik/ = {{k — 
l)l + l,{k-l)l + 2,...,k^}. 

The block-average dynamics is the continuous time Markov chain obtained 
as follows. There is an independent rate-1 Poisson clock at each block. When 
block li/ rings, a further block Ij/ is chosen uniformly at random (with 
replacement, i.e. the choice may be li/ itself). If i = j, we do nothing. If 
^ 7^ J i we choose uniformly the new configuration on li i U Ij/ among all 
configurations compatible with the letters outside the blocks li/ and Ij/. 

The above defined process is reversible with respect to the uniform prob- 
ability z^, and its Dirichlet form can be written as 

, N N 

(33) W/) = ^EE'^[(A./)']' 

i=l j=l 

where the average gradient is given by 

(34) Aijfia) = E(/(e) - fi<T)Hr] = ek^ry = vrfcciV fc / 

(en 

for i 7^ j, and where we agree that Ai^if = 0. Also, note that 

where Varjj stands for the variance w.r.t. i/[-|7rfcr/V A; 7^ i, j], the conditional 
probability obtained by freezing the configuration in all blocks Ik/, k 7^ 

Proposition 2.3. Let 'y{N,i) denote the relaxation time of the block- 
average dynamics. Then, 

(35) l{N,i) = l, 
for every £>l,N>2,n = Ni. 

Proof. We first observe that 7(2, £) = 1 for all £> I. Indeed, when 
AT = 2, we have ^1,2/ = ^2,1/ = i^[f] - f, so that V{f, f) = E[{f - v[f]f] = 
Var(/). 

We turn to the case > 3. Using the notation (18) of Proposition 2.2, 
we write, for any / : — > M, 

1 ^ 

(36) U.{^-V)f) = ^Y.UJ-V,f). 
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Note that Vjf{r]) coincides with the function r/ — > i/[/|7rj?7], the conditional 
expectation of /, given the configuration on /j £. In particular, denoting by 
Varj(/) the variance of / w.r.t. i/[-|7rjr/], we have 

(/, / - r,f) = {f- Vjf, f - V,f) = i^[Var, (/)]. 

Once the i variables in Ij £ are frozen, the measure z/[-|7rjr/] is the uniform 
measure on all {n — 1)1 permutations of n — £ letters over n — i vertices. For 
any 77 G O, we may therefore estimate, by definition of 'y{N,i), 

Var,(/) < ^^^^^EE-[(A../)V,^]. 

Taking i/-expectation, we then have 

Averaging over j and observing that 

E E E ^HAklf] = N{N - 2)V{f, /), 

we obtain 

1 ^ N -1 

(37) ^ E(/' / - ^.Z) < 7(A^ - /)• 

If (/,!) = 0, we know by Proposition 2.2 that (/, (1 - P)/) > 
From (36) and (37), this implies 

(/,/)<7(A^-l,W/,/)- 
In conclusion, we have shown that 

(38) 7(7V,^) <7(Af- 1,£), Af>3. 

This shows that 7(iV, i) < 7(2, £) = 1 for any > 2, £ > 1. 

To prove a lower bound on ^{Nji), let / denote the indicator function of 
the event that letter 1 belongs to block Ii^£. Clearly, the expectation of / 
is jj, and Var(/) = ^j^- On the other hand, one can easily check that, for 
every j / 1, one has i>{{Aijf)'^] = jj^. Therefore, for this function, 

^(/'/) = | E ^[(^i,,/)'] = ^ = Var(/), 
which implies 'y{N,i) > 1. □ 
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2.4. Extensions of the block-average dynamics. Here we extend the dy- 
namics defined by (33) to the case where n is not a multiple of I. We thus 
consider the case n = Nl + m, 1 < m < i — 1. We say that a collection 
D = {Ij} of subsets of Vn is an ^-partition if: 

(i) Vn = Uj Ij and Ij n 4 = for j ^ k, 

(ii) all the /j's but one are ^-blocks, 

(iii) the remaining Ij is an m-block. 

Any ^-partition is therefore made of N ^-blocks (for some integer N) and 
one m-block. An ^-partition is said to be of type 1 if vertex x = 1 is the 
left endpoint of one of the blocks Ij. We use symbols Ij^i, j = 1, . . . , N , for 
the ^-blocks and lN+i,m for the m-block. Moreover, there are exactly 1 
^-partitions of type 1, depending on the position of lN+i,m relative to the 
Ij^i, and we denote by the ^-partition of type 1 for which Ip^^i^rn is the 
kth. block in the partition starting from vertex 1. The elements of Dj. are 
denoted by /jj and (A: = 1, . . . , + 1). 

Let 'Cfe(/, /) denote the Dirichlet form 

N N 

(39) V,{fJ) = -Y^Y.''[{Af;^ff], 

i=ij=i 

(k) / \ (k) 

where the average gradients A- j are defined by (34), with Ij/ in place of 

Ij^£. Since the m-block /^^^ ^ is never updated, the Dirichlet form (39) is 
nonergodic. However, the average over k of T>k is ergodic, and we have 

Proposition 2.4. For any i>l, N >2 and m < £ — 1 such that n = 
Ni + m, for every function / : ^ M, 

(40) Var(/)<|P(/,/), 
where 

-, N+l 

(41) V{fJ):=-—Y,V,{f,f). 

k=i 

Proof. For A; = 1, . . . , A^ + 1, let Jk denote the m-block ^ of the 
partition Dk. Also, let vr^ denote the projection rj — > rjj^. Note that, by con- 
struction, the m-blocks J/, have no overlap. We may then apply Proposition 
2.2, defining V as in (16), with Ik := Jk and N replaced by A + 1 (for every 
A > 2, we have here A + 1 blocks of length m). We then see that, for any 

/, 

(42) Var(/) < ^(/, (1 - V)f) < ^(/, (1 - V)f). 
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We now observe, as in the proof of Proposition 2.3, that for each k, the 
expression (/, (1 — Vk)f) can be written as z^[Varfc(/)], where Varfc(/) stands 
for the variance of / w.r.t. i/(-\7r'^r]), the probabihty obtained by freezing the 
m-block Jfc. Clearly, for each rj, the measure i^(-|vr^?7) is uniform over the 
configurations oi n — m = Ni given letters. For each k and rj, we may then 

(k) 

apply Proposition 2.3 to the system of N blocks Ij/, j = l,---,-^, and 
estimate 

, N N 

(43) Varfc(/) < ^ E E ^(4^?/) Vfc^]- 

i=l j=l 

Taking i/-expectation and averaging over k, we therefore obtain 

(44) (/, (1 - V)f) < —— E W, /)• 

k=i 

The proof is complete. □ 

2.5. Comparison estimates. The first comparison allows us to progress 
from the block-average dynamics to an intermediate block dynamics, the 
moves of which are (1) averaging of adjacent blocks and (2) exchanging of 
adjacent blocks. The exchange moves are speeded up by a factor N'^. We 
refer to this intermediate process as the local average-exchange dynamics. 

The setting is as in Proposition 2.4 above, with defined by (39). Given 

(k) (k) 

the type 1 ^-partition Dk , we agree to rename the ^-blocks /j ^ so that ^ 
comes after the m-block l'^} after I'^J and so on. The exchange 

gradients E^^^-^ appearing in the statement below are defined by 

(45) 4^^J(r?)=/(r?(^'^+i)) -/(,?), 

where, for any i,j, r/^*'-'^ denotes the configuration r] after the interchange of 
block I^f with Ij^i, that is, T^kV^^'''^ — ^kV for all k ^ while vrj/y^*'-'^ = TTj/y 
and TTir]^^'^^ = TTjrj. We agree that 



Lemma 2.5. For every function / : ^ M, for every k = 1, . . . ,N + 1, 

N~l N-l 

(46) Vkifj) < 3N' E W(<-li/)'] + h E '^[i4:liff]- 

1=1 i=l 

Proof. Let Tij : O — > J7 denote the transformation r/ — > rj^^'^h Then, as- 
suming i < j, we see that 

i,j — -ti,j+l-t«+l,i+2 • • • -t j-2,j-l-' j-l.jJ j-2,j-l • • • j+l,i+2-ii,'t+l- 
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Define H = j — i and set 

Uh = Ti^h-l,i+h ■ ■ •Ti+l,i+27i,i+l) h = l,...,H. 

Also, set 

Sh = Tj_h-i,j-h ■ ■ ■ Tj-2,j-iUH-, h = I, . . . ,H - 1. 
In this way, setting Sq = Uh and C/q = 1, we write 

= f{SH-iv) - fiUHV) + fiUHV) - fiv) 

H-1 

= lfiTj~h~i,j~hSh~iv) - f{Sh~iv)] 

h=l 

H 

h=l 

= A{f)+B{f), say. 

We then estimate 

z.[(Eg)/f]<2^P(/)n + 2z.[(i?(/))2]. 
By Schwarz's inequahty, 

H-l 

h=i 

H-l 

= (/?-l)^z.[(/oT,„,_i,,„,-/)2] 

h=l 

N-1 
i=l 

where we have used the invariance of v under all transformations S^- The 
same reasoning gives the same estimate for z^[(i3(/))^]. This shows that, for 
each couple we obtain the following estimate for the exchange terms: 

(47) u[{E^,'^ff]<ANY.u[{Ei%jf]. 

i=l 

We now have to estimate the average terms. To this end, observe that, if the 
symbol iOij{rj) is used to denote the configuration r] outside the two blocks 
I^f and we may regard Af'j f as the function Af'j f{r]) = u[f\uJij{r])] — 
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(k) 

f. Now a simple change of variables shows that for any other block, If^^, 

where, as usual, r]^^'^') stands for the configuration in which the blocks 
Ij^£ ,If^j have been exchanged and f^-''^\r]) = /{r]^^'^"^)- We then have, using 
the invariance of under the exchange rj — > r]^^'^\ 

= z.[(i.[/(^»|a;,,,(r?)]-/(^»)2] 

Moreover, 

and Jensen's inequality implies 

^[(^W(/0>)_/))2]<^[(/0>)_/)2]. 

The last expression is nothing but z^fC-E-j^/)^], and we can estimate it 
by (47). Summarizing, choosing, for example, h= j + 1 we have obtained 

(48) W(45V)'] < 2^(^(5+1/)'] + 8iV ^ W(4'li/)']- 

(=1 

Recalling the expression (39) for the Dirichlet form 'Cfc(/)/)) the bounds 
(47) and (48) imply the desired assertion (46). □ 

The next task is to rewrite the exchange and average contributions to 
the local block dynamics appearing in the r.h.s. of (46) in terms of the 
inversions t] 77^'^. We start by writing each exchange term in terms of 
three inversions. To this end, we denote by z the left end vertex of Ii^£ , so 

(k) 

that 2; + ^ is the left end vertex of ^ . 

Lemma 2.6. For every function / : $7 ^ M, for every k = 1, . . . ,N + 1, 

(49) i^[{eI%J)'] < M{Rz,2i-iff + (Rz+i/^iff + iRz,i-iff]. 

Proof. Denote by W^^h : ^2 ^ 17 the inversion transformation r/ — > rj^'^. 
It is clear that Wz,2£-iWz+e/-iWz/-i'r] coincides with 7/^*'*+^^ Therefore, 

eI%J{v) = Rz,2i^if{Wz+e/-iWz,i-iv) 

+ Rz+e,e~if{Wz,e-iq) + i?.,,_i/(r/). 
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The conclusion now follows from Schwarz's inequality and the invariance of 
V under all transformations W^^h- D 

Our last comparison shows how to bound each average term in (46) by 
means of inversions. 

Lemma 2.7. For every function / : $7 ^ M, for every A; = 1, . . . , + 1, 

(50) y\i^Afl^ff\<^ E^[(^->//)' + («-+i>'-i/)']- 

Proof. The starting point is the following standard spectral gap esti- 
mate for random transpositions. Consider a system with d vertices and d 
letters. Let V^,}/ denote the gradient associated to the transposition of the 
letters at vertices x and y, that is, Vx,yf{ri) = fiv^'^) — /{rj), with ry^''^ de- 
noting the configuration identical to rj out of x,y and such that {rj^''^)x = f]y 
and {'r]^'^)y = r]x- It is then well known (see, e.g.. Theorem 5.1 in [2] for a 
simple proof) that, for all d > 2 and all functions /, 

(51) Var,(/)<^Ei/,[(V,,,/)2], 

where is the uniform measure over the d\ permutations, and Var^ denotes 
the variance w.r.t. I'd- 

We want to apply this bound to the system with d = 2i which is obtained 

(k) (k) 

by freezing all the letters in the complement of U I^^^ . Recall the notation 

~ ^[f I ^iA+iiv)] ~ f th^t was used in the proof of Lemma 2.5. 
We then have, for each rj and every / : $7 ^ M, 

(52) Var(/ I a;i,,+i(7?)) < ^ ^ u[{Vx,yf? \ u:i,^+l{v)]. 

x,yGl} Ui I , 
I i+i 

Clearly, each transposition is written as the composition of two inversions: if, 
for example, y = x + h, then Vx,x+hf{'n) = Rx,hf{v''^^'''^^~^)+Rx+i,h-if{v)- 
Recalling that z^[(4?+i/(^))^] = i^[Var(/ | 

^iA+i{'n))]j the bound (50) is seen 
to be a straightforward consequence of (52). □ 

2.6. Proof of the upper bound in Theorem 1.1. We now combine the 
spectral gap estimate of Proposition 2.4 with the three comparison lemmas 
above. Recall that we are looking for a bound of the form r(n, L) < C{n V 
f^), that is, 

(53) Var(/)<c(^nV J)^:(/,/), 
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for an arbitrary function /, with £{f,f) given by 

(54) £:(/,/) = li- Y: E'^[(^-//)']- 

"^"^-^ xev,,e<L 

For every 1 < £ < L, we use Proposition 2.4 and Lemma 2.5 to write 

(55) Var(/) < CN^Exif) + CAv(/), 
with the exchange and average terms given by 



1 



N+lN-l 



(56) Ex(/) := -— Y: E '^iKiJn 

k=i i=i 

Af+lAf-l 

(57) Av(/) := ^ E E '^[(A^'ii/)']- 

fc=l i=l 

We start by estimating the average term Av(/). Using Lemma 2.7 and ob- 
serving that for every x £ Vn, on any given partition the number of 

(k) (k) 

indices i such that x £ U I^^^ is at most 2, we have 

Av(/) < 7 E E ^m^,iff + iRx+i,i~iff] 

x£Vn l<2£ 

<7E E-[(^^.^/)']- 

x€V„ i<2e 

Therefore, for any S > and £ <n such that 6L < £ < ^L, we have 

(58) Av(/)<8rin£:(/,/). 

Summarizing, taking, for example, 6 = 0.1 and adjusting the value of the 
constant C in (55), we have obtained that if SL < £ < ^L, then, for every /, 

(59) Var(/) < CN^ Ex(/) + Cnf (/, /). 

We now turn to an estimate of the exchange terms (56). Observe that, 
so far, only partitions of type 1 enter the definition of Ex(/); see (56). 
However, by symmetry, we could just as well use, for any y E Vn, partitions 
of type y defined as those ^-partitions {Ij} of Vn for which the vertex y is 
the left endpoint of one of the blocks Ij. Again, there are exactly N + 1 

(k v) 

^-partitions of type y, say D^^y, k = l, . . . ,N + 1. Let us denote by /j ^ the 

(k v) 

corresponding ^-blocks and define the associated exchange gradients El j 

as in (45) and (34) with /j^'^^ in place of Ij,£. We may then replace Ex(/) 
in (59) by the following average over partitions of type y, for y = 1, . . . ,£: 



^ e N+iN-i 



(60) 7 E Ex.(/), Ex,(/) := ^ E E K(4ti/) ] 
y=l k=l 1=1 
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Now, each term Exj^(/) is estimated by means of Lemma 2.6. Moreover, it 
is easy to check that for every x £Vn and k = 1,...,A^ + 1, there is only 
one vertex y G {!,...,£} such that x is the left endpoint of one of the /j^'^^ 
(j = 1, . . . , A^) . It follows that 

y=l X&/n 

We want to average this expression over bL < i <^L. To this end, choose 
5 = 0.1 and Lq such that the number of integers i satisfying 6L < £ < is 
greater than L/4 for every L> Lq. If L > Lq , we can therefore estimate 

1 \ - „ , „^ „n 



#U:6L<£<(1/2)L} ^ ^ i L 

— — \ / ' 1 SL<i<(l/2)L y=l 

for some universal constant C. Going back to (59), we first replace Ex(/) 
by (60) and then take the average over SL < £ < ^L. In this way, recalling 
that = [n/£] <n/ {6L) we obtain, for L> Lq and a new universal constant 
C, 



(61) Var(/) < C-pf (/, /) + Cn£{f, f) < 2C (^n V -^jSif, /)• 

It remains to consider the case L < Lq. In this case, it is sufficient to prove 
that r(?i,L) < Cn^, for some universal constant C. Recall that, when L = 1, 
T{n,l) <Cn^, by the well-known diffusive bound for local transpositions 
(see, e.g., [2]). On the other hand, the obvious bound 

shows that T(n,L) < Lot(?i, 1), for any L < Lq. This concludes the proof of 
the upper bound in Theorem 1.1. 



2.7. Proof of the upper bound in Theorem 1.2. This is a straightforward 
consequence of the previous bounds on r(n, L). Indeed, there is some Ci < oo 
such that 0^-1 > 1/Ci for any ^ < (1 - 6')~^ Oe (0, 1). Therefore, choosing 
an integer L < (1 — 9)~^ with < 6*2(1 — 9) and removing all inversions 
with £> L, we have 



T{n,e) < CiC2T{n,L) < C7iC|(n V [n(l - 0)]^). 
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3. Lower bounds. Here we are going to find the appropriate slow modes 

3 

of relaxation. As already discussed in [6, 8], the bound 0(n V ^) corre- 
sponds to the competition of two different phenomena: the diffusive scale 
0{jj) is related to the transport of local information, while the scale 0(n) 
originates from the difficulty of separating two adjacent letters during the 
time evolution (described in terms of conserved edges in [6]). For the sake 
of completeness, below we explicitly derive these bounds, although some of 
them appear already (in some cases with explicit constants) in [6]. 

3.1. Proof of the lower bound in Theorem 1.1. Let the n letters be la- 
beled by the integers from 1 to n and denote by the indicator function 
of the event {letter n is at vertex x}. Let 5 : [0, 1] — > M be a smooth function 
with Jq g{t) dt = 0, /q g{tY dt = l. Set 

(62) ^p{r]) = ^ g{x/n)C.. 

As n — > 00, we have — > and — > 1. We may therefore estimate 
Var(^) > ^, if n is large enough. The lower estimate T{n,L) > is then 
a consequence of 

Lemma 3.1. There exists C < 00 such that 

(63) £^^p,^)<c^ C g'itfdt. 

n-^ Jo 

Proof. Write I^/ for the {£ + l)-block with left end x and right end 
X + i. Observe that, for any rr E V„ and i< L, we have 

E i^f - ^y) = 0- 

It follows that 

Rx,ii^= E ig{y/n)-gix/n)){cf-^y). 

We now expand g{y/n) — g{x/n) = g'{x/n){y — x) jn + 0(/^ jr?). Observing 
that 

E -C.)']<2(^ + l)/n, 

y^ix,t 

and ignoring higher-order terms, we have 
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The proof is then completed by averaging over x £Vn and i< L. □ 

To prove the bound T{n,L) > n/C, we may proceed as follows. Let x 
denote the indicator function of the event {letter 1 is adjacent to letter 
2}. It is easily seen that, for every n > 3, the probability of this event is 
u[x] = 2/(n — 1). The variance is given by Var(x) = 4(n — 2)/{n — 1)^. The 
desired estimate then follows from 

Lemma 3.2. For every n>3, 

(64) £ix,x)< 



n(n — 1) 



Proof. Observe that {Rx/xf = x(l — X^'^) + — x)- Let us analyze 
the contribution of x(l ~ X^'^) (the other term contributes the same by 
symmetry). If 1,2 are adjacent in r/, but not in t/^'^, this implies that either 
Tjx € {1,2} or Tjx+i € {1,2}. Therefore, for every £, 

E X(l-X"'')<4x. 
x&V„ 

Taking expectations and dividing by n then gives the desired result. □ 

3.2. Proof of the lower hound in Theorem 1.1. For the bound T{n,9) > 
[n(l — 9)]^/C, we simply follow the computation in Lemma 3.1 to obtain 

The required bound now follows from the fact that (1 — 0)^J2i<n^^~^'^^ 
uniformly bounded in G (0, 1). Finally, the bound r(n, 6) > n/C is straight- 
forward, since Lemma 3.2 still holds when S{XtX) is replaced by £g{x,x)- 

3.3. A lower bound on the log-Sobolev constant. Here we prove the es- 
timate (10). Let n be even and denote by the indicator function of the 
event 

{the first n/2 letters occupy the first n/2 vertices}. 
Then, :,[^] = Q^y^ and 

(65) Enm = -i.[^]\ogm>-m- 

On the other hand, only inversions r] — > 77^'^ with x £ {n — L, . . . ,n}U {n/2 — 
L, . . . ,n/2} can affect the value of In such cases, we have i^[iRx,iC?] < 
2i'[(^] for any £ < L. It follows that 

(66) £{V~^,v~o=£i^,o<—m- 

n 

Combining (65) and (66), we arrive at the desired estimate. 
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